Feature Selection Method Based on Honeybee-SMOTE for Medical Data Classification
نویسندگان
چکیده
Bio-Medical data analysis has an important role in clinical practices. Usually, bio-medical have complex issues like skeweedness, redundant and irrelevant attributes etc.. Several unrelated features frequently degrade the accuracy of classifier while using with imbalanced datasets. The selection becomes critical this situation. key goal feature is to establish a subspace that maintains even as reducing excessive computational learning cost casting off noise. Appropriate approaches are highly dependent on their ability match issue context uncover fundamental patterns within data. This study’s main construct disease detection model uses hybrid feature-selection strategy based Honeybee-SMOTE classification c4.5 algorithm. empirical results suggested methodology's superiority over competing methods regarding parameter, precision-parameter, recall-parameter, f1-score parameter G-Mean parameter. statistical collected findings demonstrates method outperforms competitive existing state-of-the-art algorithms.
منابع مشابه
FF-Based Feature Selection for Improved Classification of Medical Data
In processing the medical data, choosing the optimal subset of features is important, not only to reduce the processing cost but also to improve the classification performance of the model built from the selected data. Rough Set method has been recognized to be one of the powerful tools in the medical feature selection. However, the high storage space and the time-consuming computation restrict...
متن کاملFeature Selection for Classification Based on Sequential Data
We consider the problem of selecting features from a sequence of transient waveforms, with the goal of improved classification performance. For the example studied here, the waveforms are representative of multi-aspect acoustic scattering from an underwater elastic target. The feature selection is performed via a traditional genetic algorithm (GA), with the principal focus on definition of an a...
متن کاملA Classification Method for E-mail Spam Using a Hybrid Approach for Feature Selection Optimization
Spam is an unwanted email that is harmful to communications around the world. Spam leads to a growing problem in a personal email, so it would be essential to detect it. Machine learning is very useful to solve this problem as it shows good results in order to learn all the requisite patterns for classification due to its adaptive existence. Nonetheless, in spam detection, there are a large num...
متن کاملH-BwoaSvm: A Hybrid Model for Classification and Feature Selection of Mammography Screening Behavior Data
Breast cancer is one of the most common cancer in the world. Early detection of cancers cause significantly reduce in morbidity rate and treatment costs. Mammography is a known effective diagnosis method of breast cancer. A way for mammography screening behavior identification is women's awareness evaluation for participating in mammography screening programs. Todays, intelligence systems could...
متن کاملOptimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines
In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Informatica
سال: 2023
ISSN: ['0350-5596', '1854-3871']
DOI: https://doi.org/10.31449/inf.v46i9.4098